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Abstract 

IT) ' 

We apply a promising new method from the field of representations of Lie groups to calculate integrals over 
unitary groups, which are important for multi-antenna communications. To demonstrate the power and simplicity 
of this technique, we first re-derive a number of results that have been used recently in the community of wireless 
information theory, using only a few simple steps. In particular, we derive the joint probability distribution of 
eigenvalues of the matrix GG^, with G a semicorrelated Gaussian random matrix or a Gaussian random matrix 
with a non-zero mean. These joint probability distribution functions can then be used to calculate the moment 
f*"*". ' generating function of the mutual information for Gaussian MIMO channels with these probability distribution of 

their channel matrices G. We then turn to the previously unsolved problem of calculating the moment generating 
function of the mutual information of MIMO channels, which are correlated at both the receiver and the transmitter. 
, From this moment generating function we obtain the ergodic average of the mutual information and study the outage 

HH probability. These methods can be applied to a number of other problems. As a particular example, we examine 

c/3 , unitary encoded space-time transmission of MIMO systems and we derive the received signal distribution when the 

i ^ i ' channel matrix is correlated at the transmitter end. 

Index Terms 

> '. 

Capacity, Space-Time Coding, Multiple Antennas, Unitary Space-Time coding, MIMO, Random Matrix Theory, 
Character expansions. 

o\ : 

I. Introduction 

in ■ 

O ' TN recent years, a flurry of research has resulted from the prediction [1], [2] that the use of multiple antennas 
c/2 . M. to transmit and receive signals leads to substantial increases in information throughput. In an effort to quantify 
Q \ these throughput gains, several authors have analyzed the asymptotic behavior of the multiple-input-multiple- 
^ ■ output (MIMO) mutual information for large antenna numbers. In [3], [4] the asymptotic ergodic capacity (mutual 
^ , information averaged over channel realizations) was evaluated in closed form for Gaussian i.i.d. channels. In [5]- 
[8] the assumption of i.i.d. channels was lifted and the effects of spatial correlations were included. In addition, 
asymptotic results on the variance of the mutual information were derived, [9]— [12], which together with its mean 
were shown [9] to describe the full distribution very accurately even for small antenna numbers. 

More recently, several exact results regarding the distribution of mutual information for the MIMO link have 
appeared in the literature. Using different methods these prior works calculate the moment generating function of 
the distribution of the mutual information for various assumptions about the statistics of the channel matrix. From 
the generating function, the moments can be generated by direct differentiation, or the probability of outage [13] 
can be derived through a simple numerical integral. Specifically, [14] first extracted the moment-generating function 
for Gaussian uncorrelated channels. Furthermore, in [15], [16] Rician channels were used. In addition, [17], [18] 
treated the case of semi-correlated channels, where either the transmitting or receiving antennas are correlated (but 
not both). In particular, [17] dealt with the case where the number of correlated antennas is less than or equal to 
the number of uncorrelated antennas, while [18] dealt with the opposite case. All above progress in the calculation 
of the moment generating function relied essentially on previously known results in the theory of Wishart matrices 
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(matrices of the form GG^), which allowed the calculation of the joint probability distribution of eigenvalues of 
such matrices. 

Interestingly, one can formulate the problem of calculating the joint probability distribution of eigenvalues of 
GG^ in terms of integrals over the unitary group. Such integrals have been analyzed since the 1980's in the field 
of high energy physics [19], [20] and even earlier in the mathematics community [21]. More recently, [22] recast 
the same problem from an elegant group-theoretic point of view, which allows one to perform such integrals in a 
few simple steps. In this paper we will apply these new group-theoretic methods to calculate the joint distribution 
of eigenvalues of GG^, for a number of different channel distributions, p(G). From the eigenvalue distribution 
the moment generating function of the mutual information is readily obtained. We first simply re-derive a few 
of the above mentioned prior results [15], [17], [18] and then we obtain a new result of the moment generating 
function for zero-mean Gaussian channels with correlations in both (transmitting and receiving) ends of the link. 1 
In addition, we discuss in an appendix how similar techniques can be powerful in analyzing unitary space-time 
encoded transmission in MIMO systems. [24]-[27]. 

The techniques we use are both simple and powerful, and a main aim of this paper is to expose the reader to 
these methods. Although we cannot attempt to give a complete exposition of the field of group-theory, we are able 
to give a small and simple set of rules, definitions, and theorems that will allow the interested reader to apply these 
techniques to other problems of interest. 

A. Outline 

We start with some notational definitions and the introduction of the system model in Section |HJ In Section 
ITTll we introduce the reader to some basic definitions in group theory and quote the basic results of the method, 
which will be applied in the remainder of the paper. We illustrate the main steps of this method by calculating the 
known joint probability distribution of the eigenvalues of GG^ for the semicorrelated Gaussian MIMO channel G 
in Section HV-Bl which involves a single unitary integral. In Section ITV-CI we generalize our method to derive the 
joint distribution of eigenvalues of GG^ for G being a non-zero mean Gaussian MIMO channel. Although also a 
known result, this analysis gives us further insight in the utility of our method, since this case requires a double 
unitary integration. 

In Sections IIV-DI and [V] we apply the technique to the case the zero-mean Gaussian channel with correlated 
transmitting and receiving antennas and derive a closed-form expression for the moment generating function for the 
corresponding mutual information, which is a new result. We use this formula in Section |2] to obtain an analytical 
expression for the ergodic capacity and also study the outage probability. 

As mentioned above, in this paper we aim to expose the reader to the new group-theoretic methods discussed in 
Section These methods are also applicable in other areas of information theory of MIMO systems. To illustrate 
this point, we apply this method in the context of unitary space-time coding and calculate the received signal 
distribution for the case where the channel is correlated at the transmitter end, a new result. In order not to disrupt 
the flow of the main part, this analysis appears in Appendix fVl 

II. Definitions 

A. Notation 

1) Vectors/Matrices: Throughout this paper we will use bold-faced upper-case letters to denote matrices, e.g. 
X, with elements given by X^. Bold-faced lower-case letters, e.g. x will be used for vectors with elements xi 
and non-bold lower-case letters will denote scalar quantities. Also, the superscripts T and f will indicate transpose 
and Hermitian conjugate operations and I n will represent the re-dimensional identity matrix. Finally, diag(x) will 
denote a diagonal, but not necessarily square matrix with the elements of the vector x on the main diagonal. 

2) Determinants: det(f(i,j)) will denote the determinant of a matrix with the i, jth element given by f(i,j), 
an arbitrary function of i and j. In order to keep the notation consistent, we will always use the indices i,j for 
this purpose in determinants. Furthermore, we will be using the notation A(x) for an re-dimensional Vandermonde 
determinant of the elements of the ra-dimensional vector x. 

A(x) = A([xi, . . . , x n ]) = det^" 1 ) = Y[( Xi - Xj ) (1) 

i>j 

'in [23] we derive a similar result used to calculate the marginal eigenvalue distribution of GG f with such statistics. 
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In addition, due to the extensive use of determinant expansion in the text, it is convenient to define the following 
notation 

M -. M 

det X = J]) sgn(a) J] X iai = — ]T ]T sgn(a) sgn(b) JJ * a<6j (2) 

a i=l iW ' a b i=l 

where X is a M -dimensional matrix. In the above expression the vector a = [a\, a,2, ■ ■ ■ , bm] is a permutation of 
the sequence [1,2,..., M]. sgn(a) is the sign of the permutation a, which is sgn(a) = +1 if [a\,a2, ■ ■ ■ , om| is 
an even permutation of the sequence [1,2,..., M], sgn(a) = — 1 for an odd permutation. It is implied here that if 
a is not a permutation of [1, 2, ... , M], sgn(a) = 0. The summation is over 1 < 04 < M for i = 1, . . . , M. 

3) Integral Measures: We will use three types of integrals: In the first we will be integrating over the real and 
imaginary part of the elements of a complex m rows x m co i s matrix X. The integral measure will be denoted by 

a=l a=l ^ 

We will also be using the notation 

n 

dx = Y[ dxi (4) 

i=l 

for integrals over the positive real axis of the elements of the vector x. Finally, DU will represent the Haar integral 
measure [28] for integrals over the elements of the unitary group U(M) for a specified M. 

4 ) Antenna Numbers: m and n r will denote the number of antennas at the transmitter and receiver, respectively. 
We will also use the following notation for maxima and minima of antennas: M = max(nt, n r ),N = min(n r , nt). 



B. System Model 

We consider the case of single-user transmission from n t transmit antennas to n r receive antennas over a narrow 
band fading channel. The received n r -dimensional complex signal vector y can be written as 

y = Gx + z (5) 

G is a n r x n t complex matrix of the channel coefficients from the transmitting to the receiving arrays, x is 
the nt-dimensional vector of transmitted Gaussian signals, while z is the n r -dimensional additive Gaussian noise 
vector. Without loss of generality, both x and z are taken to be i.i.d., zero-mean with unit-variance. One can 
straightforwardly include arbitrary covariances for both in the end result (e.g. see [9]). Finally, the channel G is 
assumed to be block-fading, i.e. constant over a significant amount of time and then changing completely according 
to its statistics. For concreteness, Gaussian channel statistics of G are assumed. 

During reception, the receiver is assumed to know the instantaneous channel matrix G. Thus, the mutual 
information can be expressed as [1] 



J(y;x|G)=logdet (i^ + G+g) 



(6) 



The log above (and throughout the whole paper) represents the natural logarithm and thus / is expressed in nats. 

We describe the statistics of the channel G in terms of its probability distribution p(G). Due to the underlying 
randomness of G, the mutual information / is also a random quantity. In this paper we will analyze the statistics 
of the mutual information / in (O, assuming that the channel G is a Gaussian random matrix. We will specify 
details of several different types of Gaussian distributions to study below (iid, semicorrelated, nonzero mean, and 
correlated). In order to analyze the statistics of the mutual information /, we will calculate the generating function 
g{z) defined as, 



g(z) = E G 



Eg [{ 



det(I nt + G 



»G)}' 



where the expectation Eg [ ■ ] is with respect to the probability distribution p(G) of the matrix G. 
known, all moments of / can be generated by evaluating the derivatives of g(z) at z = 0. 



E G [I] 



Eg 



(V) 

Once g(z) is 

(8) 
(9) 
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etc. In addition, the probability of outage, P ou t, i-e. the probability that the mutual information is less than a given 
value I out [13], can be readily obtained by performing the integral 

Pout = E G [8 (J - I out )} = / — g(iz) — (10) 



2m 



z ■ 



i0+ 



where we have used the identity Q(x) = J dz/{2m) ex.p(ixz)/(z — i0 + ) for the Heavyside step function 6(x). 

From the form of the mutual information in © and Q, it is clear that the generating function can be written 
very simply in terms of the eigenvalues Aj of the matrix G^G, as 



N 
Li=l 



N 

n 



d\i(l + \i) z P({\j}) 



(11) 



where the product is over the N = min(rtt, n r ) non-zero eigenvalues of G^G and the expectation E^[- ] is defined 
as a multiple integral over the joint distribution P({Aj}) of the eigenvalues {Aj}. Note that P({Aj}) will be different 
for each distribution p(G) we will analyze. 



III. Important Results from Representation Theory of Unitary Groups 

In this section we will briefly introduce a few relevant quantities and a number of important results from the 
theory of representations of unitary groups, which will be used in the paper. The interested reader can refer to 
several textbooks, including [28], [29]. We do not attempt a thorough exposition of the subject of group theory. 
However, we do aim to give a self-contained introduction, so that the interested reader can effectively apply these 
methods to tackle other problems of interest. 

A d-dimensional representation V of a group G is a homomorphism from G into a group of <i-dimensional 
invertible matrices. Such groups include Gl(M), the group of complex invertible matrices of dimension M, and 
U(M) is its subgroup of unitary matrices (i.e., the subgroup of matrices U such that UTJt = I). These groups 
are also called Lie groups, i.e. differentiable manifolds with both multiplication and inverse operations being 
differentiable. 

1) Irreducible Representations: A d-dimensional representation is irreducible if it has no non-trivial invariant 
subspaces. In other words, a representation is irreducible if there exists no <i-dimensional invertible matrix A, such 
that AHA -1 becomes block diagonal for all group elements H. 

The irreducible representations of the unitary group U(M) (and Gl{M)) can both be labelled by an M- 
dimensional vector m = [mi,m2, . . . , tiim], with integers mi > 771,2 > • • • > fnM > 0. [28], [29] 

2 ) Dimension of Irreducible Representations: The dimension d m of an irreducible representation labelled by m 
is the dimension of its invariant subspace. For both cases of U(M) and Gl(M) it can be shown [30] that d m is 
given by 



d r 



M 

n 



{M + rrii 



(M 



det 



1 



(mi - i + 



(12) 



where the matrix elements inside the determinant with mi — i + j < are zero. 

The above form of the dimension d m in terms of a determinant of a matrix of inverse factorials is not particularly 
useful for manipulation. The following lemma makes its form more manageable. Its proof appears in Appendix U 

Lemma 1 (Vandermonde determinant form of d m ): The dimension d m of an irreducible representation m of 
U(M), given in (fL2l can be written as 



d r , 



M 

n 



A(k) 



\{M-i)\ 

where A(-) represents the Vandermonde determinant and the vector k has elements 



(13) 



rrii 



i + M 



(14) 



for i = 1, . . . , M and m 8 are the elements of the representation vector m. 
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3) Character of Representation: The character xio) °f a group element g in the representation V is equal to 
the trace of the corresponding matrix, i.e. x(d) = Tr{y (<?)}. Also, x(d) depends only on the eigenvalues of V(g). 
Calculating the characters of irreducible representations is greatly facilitated by Weyl's character formula [29], [31], 
which for U{M) and Gl{M) takes the form: 

det (a mi+M ~ j ) 

Xm(A) = Tr { A( m ) } = — U 1 (15) 

i- > A(ai, . . . ,a,M) 

where the index m denotes the irreducible representation (mi, . . . , tum) and aj, for i = 1, ...,M, are the 
eigenvalues of A in the fundamental (M-dimensional) representation. Here A in Xm (A) represents a M-dimensional 
matrix corresponding to the group element that is being represented, while represents the d m -dimensional 

matrix, which is the (m)-representation of A. 

For example, the characters of the one-dimensional unitary group U(l) are given by Xn(e^) = e in ^, where 
the character index n takes non-negative integer values and e 1 ^ is an arbitrary element in U{1). Thus a Fourier 
transform 2 can be seen as an expansion in the characters of ?7(1). This suggests that the characters of a group can 
form a good basis for expanding functions, which are invariant under the corresponding group operations. Although 
such expansions are generally complicated, recently [22] showed how it can be done for exponentials of a trace of 
a Gl{M) matrix. His result can be summarized in the following important lemma. 

Lemma 2 (Character Expansion of Exponential): Let A be a member of the general linear group Gl(M) and x 
an arbitrary constant. Then the following equation holds: 

e ,Tr { A} = ^> m(x)Xm(A) (16) 

m 

In the above expression, Xm(A) is the character of A in the representation m and the sum is over all irreducible 
representations of Gl(M) parameterized with the vector m = [mi, m,2, ■ ■ ■ , tum], with integers mi > 771,2 > . . . > 
m-M > 0. In (fT6l the coefficient for each character, a m (x) is given by 



ot m {x) = (x) 



m 1 +m 2 +...+m M 



rr (M-i)\ 



d m (17) 



* (M + mi-i)\ 

where d m is the dimension of the irreducible representation and is given by d 1 2b . 

The importance of this result is that it allows us to expand the exponential in group-invariant quantities as the 
characters Xm- 

4) Orthogonality Relation between Unitary Group Matrix Elements: The integration properties of elements of 
the unitary group in different representations are given in the following (see [28]) 

/ DU Ulfu^* = -^<W<^7 (18) 

where m is the dimension of the representation and DU is the standard Haar integration measure over U(M). 

5) Cauchy-Binet Formula: Finally, we cite a useful result, which will be applied in our calculations below (see 
proof in [29]). 

Lemma 3 (Cauchy-Binet Formula): Given M-dimensional vectors a and b, and a function W(z) = J2i^=o w(i)z l 
convergent for \z\ < p, then, if \aibj\ < p for all we have: 

]T det[a k i '}det[b- 1 }UT=Mk t ) = det[W(a i b j )} (19) 

fel>fe 2 — >fcm>0 

where the determinants are all taken with respect to the indices i and j. 

These few definitions and formulae will be enough to allow us to manipulate integrals over the unitary group in 
extremely powerful ways. 

2 The astute reader will realize that the characters have only nonnegative n, whereas the Fourier transform has all n. This means the Fourier 
transform is actually a character expansion of U and IP, This complication does not enter into our particular use of the character expansion. 
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IV. Calculation of the Joint Distribution of Eigenvalues of GG^ using the Character 

Expansion 

As we saw in the end of Section III-BI one can calculate g(z) by first calculating the joint probability density 
P({Xi}) of the eigenvalues Aj. In this section we will introduce a new method to calculate the joint probability 
density for Gaussian channel matrices G with non-trivial correlations or non-zero mean. As we shall see, this 
method allows us to obtain P({Aj}) with minimal effort, using the results introduced in Section |ffl] Given its 
power and simplicity, we will expose the basic steps of the approach in the main part of the paper, even though 
the results presented in this section have been derived elsewhere, albeit using more involved and less applicable 
methods. 

To express the probability density in terms of the eigenvalues A, of GG^, it is convenient to change integration 
variables, from the elements of G to its singular values, 

IH = V>H = SVD(G) (20) 

for i = 1, . . . , M, using the standard transformation 

G = Udiag{^)\ ] (21) 

where U, V are unitary n t x n t and n r x n r matrices, respectively and diag(n) is a diagonal nt x n r matrix 
with diagonal elements [n, for i = 1, . . . , N. Including the Jacobian of the coordinate transformation we get the 
following relation [2], [32]-[34] 

N 

VG = C M ,ivA(A) 2 Yl Xf I - N d\ DXJ DV (22) 
i=i 

where A is an ^-dimensional vector with elements Aj and A (A) is the corresponding Vandermonde determinant. 
dX is the product of the A, differentials dX = d\\ . . . dXjy, while DXJ and DV are the standard Haar integration 
measures for the corresponding unitary matrices, see for example [28]. The normalization constant Cm,n is given 
by the Laguerre-Selberg integral (see p. 354 in [32]) 

N 

Cm,n = II [ r 0' + !) r ( M ~ N + •?')] ( 23 ) 
i=i 

Thus, we can express P({Aj}) as 

N 

-P({Ai}) = C MtN A(X) 2 J] Af ~ N JDXJJDV p(G = TJdiag(j*)V*) (24) 

i=l 

where the integrals are over all unitary matrices U, V and p(G) is the probability distribution of the matrix G. 

A. Case of i.i.d. channel with zero mean 

When G is independently and identically distributed (i.i.d.) with zero mean, the probability distribution is given 

by 

p(G) = e - Tr { GtG > (25) 

Substituting G = Ucfa'ag(/i)V' we obtain 

p(G) = exp —Tr ^diag(ix)^ diag(ix)^ = exp [— Tr {diag(X})] (26) 

which is independent of both U and V. diag(X) is an N x N diagonal matrix with diagonal elements Aj = /i?, 
which are the eigenvalues of G^G. Thus, the integrals over the unitary group in d24l > are trivial and we immediately 
obtain the well-known result [32] 

N 

P({A,}) = C m ,nA(X) 2 J] [Xf- N e~ x <] (27) 
i=i 

It is clear that this particular case is rather unique in that the unitary integrals in d24l are trivial. More generally, 
the argument of these integrals will contain U and V explicitly, making the joint distribution harder to obtain, as 
we shall see below. 
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B. Case of semi-correlated G with zero mean 

The simplest case of channel with a unitary matrix explicitly in p(G) is that of a semi-correlated channel, i.e. 
one with non-trivial correlations on only one side of G, with the probability distribution taking the form 



p(G) = At exp [-Tr {gT^gHI =AA T exp [-Tr {diag fyXJ^T^Xj] 



(28) 



where the normalization constant is Avj^ 1 = detT nr and diag(X) is an n t -dimensional diagonal matrix with the 
first N diagonal elements Aj = Aj, for i = 1, . . . , N and the remaining M — N ones (if n t = M > N = n r ) taking 
for now arbitrary values. In the end of the calculation we take the limit \n+i, ■ ■ ■ , Xm — ► 0. As a result, only N 
non-zero eigenvalues of GG^ will remain. 

Inserting d28t into (1241 . P({A;}) can be expressed as 



N 

P({\i}) = C M>N M T A(A) 2 n\ M_7V / ^Uexp [-Ti {diagiX^T^Xj} 



(29) 



Step 1: Integration over U. We next need to integrate out the unitary matrix U. Such integrals are well known 
[19], [21], and we could easily skip directly to the result Q7t . However, it is useful to go through this particular 
derivation, so that the reader will become familiar with the technique. Applying Lemma to the exponential of 
(l29l we get 



N 

P({X l }) = Cm, jv At A(A) 2 [] \f~ N J DU ^ a m (-l) Xm (diagCX^T^V 



(30) 



i=l 
N 



= C M>J vA'TA(A) 2 nAf- JV / DUj2*m(-l)TrUag(\) (m) (uW) f (tW)" 1 

i=l J m 

where a m (x) is defined in dT7l and the sum is over all integer vectors m = [mi, . . . , m n J with mi > m,2 > . . . > 
ftim > 0, corresponding to the representations of U (nt) (see (fToll). We remind the reader that we use the notation 
M( m ) to be the group element M in the representation m. The second equation here results from the definition of 
the character Xm as the trace of its argument. As mentioned above the integrand now is quadratic in the unitary 
matrix elements and thus we may integrate over the unitary matrix U using the standard orthogonality relation 
between unitary matrix elements, given in Eq. ( 11 8t . As a result we get 



N 



,M-N 



; m (diag(\ 



(m) 



(31) 



P({Xi}) = C m ,nMt A(A) 2 J] A 

i=l m 

= C M ^AA T A(A) 2 nAf- w E^r l11 

i=l m " m 

We may now use Weyl's character formula for the unitary groups H31 to represent the characters in terms of the 
eigenvalues of the corresponding matrices. We also use (fTTb to represent a m /d m . As a result of these substitutions 
we get 



TV 



pah}) = c m ,nMt a(a) 2 n E 

8=1 m 

where the vector t was introduced with 



n 

,i=i 



(-i) 



(mi-i + n t )\ 



det (t^- i+w «) det (A™ J - j 
A(t)A(A) 



T -i 



where eigs[T ] is the vector of eigenvalues of T . In addition, 

Cm,n = C m ,n 11^ ~ 1 ) ! 
1=1 

From the form of d32t it is clear that the change of summation variables from mi to ki with 



(32) 
(33) 

(34) 



ki = mi — i + n t 



(35) 



8 



will simplify the notation. Thus the summation over the new variables is over all integer vectors k = [k\, . . . , k nt ], 
with k\ > &/2 > . . . > k nt > 0. The resulting form of the joint probability distribution P({Xi}) is 



N 



P({A,}) = C m ,nMt A(A) 2 [] Af" 7V (-l) 



m t (n t -l)/2 



E 



i=i 



n 

.t=i 



A:;! 



det#) det(A>; 
A(t)A(A) 



(36) 



Step 2: Resummation of series. The second step in calculating P({Aj}) is to re-sum the series in (136b . Despite 
the apparent complexity of this multiple sum, it is in precisely of the form of the Cauchy-Binet formula, described 
in Lemma |3] Applying this formula with w(k) = {—l) k /k\ and therefore W(x) = exp(— x) to d36l > we obtain the 
joint eigenvalue density of eigenvalues of GG^ for the case of semicorrelated zero-mean channels: 



p({\ i })=c M , N n^A(\) 2 ii\ 



N det 

M-N /_-Qn t (n t -l)/2 



A(A)A(t) 

Step 3: Allowing for M > N. The final step in the calculation is to reassign Xj = Xj for j 



1, 



Xj = for j = N + 1,...,M. 

1) Case nt = N < M = n r : In this case d37t is essentially the final result with Xj = Xj, for j = 1 
After some cancellations we have 



P{{\}) 



in 



\ (n r -n t + j - 1)! 



A(A)(-l) n '( n '- 1 )/ 2 



det 



-t t x, 



Mt) 



(37) 

,N and 
...,N. 

(38) 



2) Case n t = M > N 



n r : 



To take the limit Ajv+i 5 ■ • • , Am — > we use Lemma |6] which is an application of 



the l'Hospital rule and appears in Appendix ITTT1 Applying this lemma to d37t with fi(x) 
probability density P({Aj}) for n t > n r 



-tiX 



n r i n t d e t 

p a^}) = n 7i n ^ a(a)(-i)«-^- 1 )/ 2 — 

j=l ■>■ i=l 



1 j ti i 



. j.n t -n r 
... , 



^ ■ p~ tiXi . . p —tiX n 
, o ..... c- 



Aft) 



we get the joint 



(39) 



Clearly, if we interchange the correlated side from transmitter to receiver, one only needs to interchange n r with 
nt in the above equations, where again ti are the eigenvalues of T _1 , see <33t . 

Summarizing the results of this section, we have calculated the joint eigenvalue distribution for GG^ with G 
being semicorrelated, using a single unified approach, rather than two independent methods used before, for the 
two limits n t > n r [35] and n r > n t [17]. 3 In addition, our method, based on the character expansion (Lemma 
|2j), the Cauchy-Binet formula (Lemma |3), and the generalized l'Hospital rule (Lemma |6j is simple enough to be 
applied to other types of problems, as we shall see below. 

To obtain the moment generating function g(z) one needs to integrate over the A's, see (fTTb . From this point one 
can employ the methods by [17], [18] to calculate the moment generating function. For completeness, we include 
the calculation of g(z) in Appendix IIVI for both cases above. 



C. Case of uncorrelated G with non-zero mean Go 

We will now analyze a simple case, where both unitary integrals are present in (1241 . namely the case of 
uncorrelated G with non-zero mean Go- In this case the probability density p(G) takes the form 



p(G) 



exp 



-Tr{(G-G ) t (G-G )} 



exp 



Tr{GSudiog(/i)Vt} +lr{vdm 5 (/i)tutGo} 



(40) 
(41) 



where 7« for i = 1,...,Nq are the non-zero eigenvalues of GqGo and where we have used the singular value 
decomposition of G, J20t , 

Step 1: Integration over U, V. It will be convenient below to have the integrations of both U and V over 
the same unitary group, therefore we will extend both Go and G in the second line in d4Tt tolfxM matrices, 



We note that had [17] applied Lemma |S| to their result, they would also had been able to get both nt > n r and n t < n r results. 
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by adding an appropriate number of columns or rows, which will eventually be set to zero. Essentially what we 
are doing is allowing initially for M non-zero eigenvalues of GG^ for the terms in the exponent and later, if 
appropriate (M ^ N), letting the extra ones go to zero. Inserting d4*Ut into (1241 we get 

No N 



, M—N^—Xi 



(42) 



P({Xi}) = C M , N A(Xf J] e-T' J] [A 

i=l i=l 

DV J DV exp [Tr{G t Udm5(/i)V t } +Tr{vdiac/(/x) t U t G } 

We may now apply the character expansion Lemma |2] to each of the exponentials of traces above. As a result, the 
second line in the equation above becomes 

f DV f DV E am(l)cw (l)Xm (cJUdmsQ^Vt) Xm , (vdiag(fi)^G ) (43) 

•> mm' 

Since both V and U appear as quadratic forms in the double sum, we may apply (fT8l to integrate both unitary 
integrals. d43t thus becomes 



E 



:i) 



d 2 



Xm (diag(\) 



E 

k L 



r nf=i(j-i)! 



n 2 



det( 7 * J ) det(A^) 
A(7)A(A) 



(44) 



where in the right hand side of the above equation we applied Weyl's character formula substituted for a m 
using Mil and redefined the summation in terms of k d35l . Once again we introduced the M-dimensional vector 
A with elements Aj = Aj for i = 1, . . . , N and (for M > N) Ajv+i, • • • , Am arbitrary at this point. In addition, we 
introduced another M -dimensional vector, 7, such that 7$ = 7$ for i = 1, . . . , Nq and, (assuming Ao < M) for 
i = Nq + 1, . . . , M it is currently arbitrary, but will eventually be set to zero. 

Step 2: Resummation of series. We next apply the Cauchy-Binet formula, Lemma [3] to resum the above sums, 
using w(k) = l/kl 2 and the Taylor expansion of the modified Bessel function [36] 

00 -. 

V*) = E ! 



r x -\2k 



k=0 



(k 



!V2 



(45) 



d44l thus becomes 



Inserting this into d42l we obtain 



M 

no- 



l)! 2 - 



det \I Q {2^ i X j 
A( 7 )A(A) 



(46) 



n^[y-i)!(jv+j 



^({Ai}) 



AH 



1)!1 ^ 

— A(A) 2 n«- 



N 



n [*r*«-* 



det I J (2^7iA 
A(7)A(A) 



(47) 



Step 3: Allowing for M > A^ and M > Nq. As in the previous section, we may now let 7, = ji for 
i = 1, . . . , Nq and Aj = Aj for i = 1, . . . , N and if M > N and/or M > Nq we may apply Lemma |6] with the 
parameter xq = to let the superfluous 7^ for i = Nq + 1, . . . , M and Aj for i = N + 1, . . . , M go to zero. The 
final expression for P({Aj}) reads 



-l)(No+N)(M-l) A ( A ) detL 



TV 



iVc 



AH 



A( 7 ) 



ik a n 



"7i . 



t-iM-N 

lL=i 



(N+i-iy. 
(i-iy. 



Ul^ [(j 



=1 J=l 1Aj=1 
where it is assumed that in the last ratio the numerator is unity when M 
when M = Nq. L is a M x M matrix with elements 



I)!] 2 Utilf- N ° 



(48) 



L 



J o(2A/7iAj) 


i < N 


j 


< N 


y-JVo-l 
j-JV-l 


i > Af 


j 


< N 


i < N 


j 


> A" 


((i-^0-l)!) 2 *i-iVoJ-JV 


i > N 


j 


> AT 



N, while the denominator is unity 



(49) 



This obtains the result by [15] using simple Bessel functions rather than more complicated hypergeometric functions, 
with only a few simple steps involved. Using this result, we calculate g(z) in Appendix IIVI 
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D. Case of fully correlated G with zero mean 

In the previous sections we recovered already known results, albeit in a very direct and simple way. We now turn 
to a problem that has not been previously solved. In this section we will analyze the joint probability distribution 
of eigenvalues of G^G when G has zero mean but is fully correlated, i.e. its probability density is 



p{G) = A/t,r exp [-TrjGtT^GRT 1 } 



with 



det T n " det R nt 
Applying (12 1 b to (l50l and then inserting it into (l24l > we get 

P({Xi}) = N t ,rCm !N A{\) 2 J DXJ J DVexp [-Tr [t^V diag (fi^K^V diag (^U^ 
= AA t ,rA(A)J(T,R,A) 



(50) 
(51) 



(52) 



where the second equation defines the quantity Z(T,R, A). At this point it is convenient to assume that M = N, 
relaxing the constraint later. 

Step 1: Integration over U, V. We may now use Lemma |2] to expand the exponential so that the integration 
over U, V can be performed. As a result we get 



Z(T,R,A) = C m ,nA(X)J2 



a, 



.(-1) 



-Xr 



-,-1 



Xr 



(A) Xm (R^ 1 



(53) 



where a m is the expansion coefficient, see (U7L and d m is the dimension of the m representation, see H2t . 
Comparing the above equation to ( 13 H . (l44l we see a crucial difference here, namely that a m , d m do not appear 
with the same powers. This is important because, using (fT7l we can no longer cancel d m , which as seen (fT2l is a 
rather complicated function. Applying Weyl's character formula d 1 5I > and redefining the summations from m to k, 
where ki = — % + M similar to d35l we get 



T(T, R, A) = V 

Ml ^ 



M 

n 



hi 



det(^) det(A^) det(rfi 
A(k)A(t)A(r) 



(54) 



where we also used Lemma ffl to express d^ in terms of A(k) and we introduced the vectors t, r, with elements 
the inverse eigenvalues of T, R, respectively, i.e. 



diag [eigs 
diag (eigs 



T 
R 1 



(55) 



Unfortunately, we may not make any further progress in resumming the above expression, due to two impediments. 
First, to apply Cauchy-Binet's formula, Lemma[51 we need two determinants of the form det(x^ J ) in the numerator. 
Instead we have three. In addition, the extra A(k) in the denominator does not allow us to separate the different ki 
sums. However, we can still get the moment generating function for the mutual information g(z), by first integrating 
over A,-. This will be done in the next section. 



V. Moment-Generating Function for G fully correlated 

Even if in Section IIV-DI we were not able to find an final expression for the joint probability of eigenvalues of 
G^G for the case of fully correlated G, in this section we will generate a closed-form expression for the moment 
generating function of the mutual information, g(z), using d54l i. 
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A. g(z) for the case M = N 

From (11 Q g(z) can be written (for M 



N) as 



. M 

j d\H(i + \iyp({\ t }) 



(56) 



where the integral is over positive Aj's, and P is the joint probability distribution function of the eigenvalues. 
Inserting d54l l into \52l g(z) can be written as 



9(z) 



T,R 



M!A(t)A(r) 



M 



dXlKl + XifAiX) 



k 



M 

n 



(-1) 

hi 



det(tf )det(rf 3 )det(A i Kj ) 
A(k) 



(57) 



A(A) 



(1 + Aj)] 



(58) 



Step 2: Integration over {Aj}. As mentioned above, we cannot resum the series in the form <57t . Instead, we 
will first integrate over A,. To proceed, it is useful to transform the Vandermonde determinant A (A) into another 
one with arguments Aj/(1 + Aj) 

i>j 

■ M 

.m=l 
' M 

ri( i + A 

_m=l 

■ M 

ri( i + A 

ra=l 

The usefulness of this form of the determinant will become apparent in a moment. We first expand the above 
determinant using sgn(b) as in ©. Thus g(z) becomes 




9(z) 



r M 

gn(b) J dX jn(H-Ai) 



2+M- 



M 



(59) 



We now wish to integrate the above expression by parts with respect to the A's, integrating the curly brackets and 
differentiating the square brackets. Usually, when integrating by parts we generate various boundary terms evaluated 
at A = and A = oo. For example, if we integrate by parts p times over a particular A, we will get a sum over p 
terms, in which the nth term (with n = 1, . . . ,p) has the square brackets differentiated n — 1 times with respect to 
this A and is evaluated at A = and A = oo. However, here we will point out that these boundary terms are zero 
in the cases we are concerned with. 

We note that according to Lemma |5] in Appendix ITT1 the quantity X(T,R, A) is exponentially convergent when 
any Aj approaches infinity. Therefore, the square bracketed quantity above, as well as any of its derivatives with 
respect to the A's is zero when evaluated at any Aj = oo. As a result, we may discard all generated boundary 
terms at Aj = oo. In addition, we observe from its definition in i54l that X is finite when any of the A's approach 
zero. Thus if we integrate each \ up to hi — 1 times by parts in d59l >. all boundary terms evaluated at Aj = 
for i = 1, . . . , M, will vanish, since they involve derivatives of the square-bracketed quantity above, not exceeding 
bi — 1 order. Indeed, we will choose to integrate by parts precisely 6j — 1 times in order to make further progress. 

To proceed with this multiple integration by parts, we first note that the (pi — l)th indefinite integral of the curly 
brackets in (l59l with respect to all Aj is 



M 

n 



t=iU b ;=xiz+M-p) 



(60) 



with the product in the denominator assumed to be unity for bi = 1. To obtain the (6, — l)th derivative of the 
square brackets in d59b we need to rewrite X as an expansion in the form of (I54t . In particular, using sgn(c) (see 



12 



©) to expand det(A^ J ) in i54t . we can rewrite the square brackets of iS% as 



Ml 



E 



Z (-i) fc - 



A(k)A(t)A(r) V P() 3 !! j 



(61) 



where the sum over c is over all permutations of (1,2,..., M). By differentiating all A, in the above expression 
bi — 1 times we get 



1 

M! 



n 

k Li=l 



E 



fe;! 



det(^)det(^) it 
A(k)A(t)A(r) V g W M 



p=i 



fe 3 -i 



Combining all above results, 5(2;) of (l59l can be expressed as 



TR 



M! 



,1/ 



dA Il( 1 + Ai 



,2 + M-l 



1=1 



E 

k 



det(tf ) det(rf') 
A(k)A(t)A(r) 



(62) 



(63) 



ill 



6,-1 



(-l)^-iA^ TT 

v ; 3 l \ z + M -p 
p=i 1 



x E s § n ( b ) s § n ( c ) n 

b,c j=l 

where the additional (— comes from the integration by parts. 

We will now integrate over the Aj's before summing over the k's. Looking at the above equation this seems 
problematic since each of the terms in the sum over k above is unbounded at large A. However, from Lemma |5] in 
Appendix |H] we know that X and thus the integrand above is exponentially bounded at large A and hence integrable. 
To circumvent the discrepancy we introduce a cutoff function /(A) for every A, which is unity at A — > and tends 
to zero faster than a power law as A — > 00. For example, we can take f(x) = e~ Sx . In the end of the calculation 
we will set 5 = and thus make f(x) unity for all x. Cutting off the integrals makes all terms finite and thus we 
can freely interchange the order of the summation and integration. Thus we have 



Afr 



TR 



Ml 



E 



det(t^) det(rj*) 
A(k)A(t)A(r) 



(64) 



M 



bi-1 



5>gn(b)s g n(c)n (-1) 6 *- 1 n 



k Ci +P 



b,c 



i=l 



\z + M-pJ 



\z+M-l 



Next it is important to appreciate the following property of the second line of the above equation. Once the integrals 
over A, have been performed, the quantities in the curly bracket depend on the index i only through the indices 
bi and Cj. Therefore, the second line of the above equation is in the form of (0 and thus it can be written as Ml 
times a determinant of the matrix W with elements 



kj +p 



i-l 

n z^-s-^r^ 



dAA fc ^l + A) z+M - 1 /(A) 



(65) 



XJ ; z + M -p 

where the second equation results from the fact that the only i-dependence of Wij is outside the integral. Thus 
Wij is simply the integral above and can be factored outside the determinant, resulting in 



M 



detW = JJWydet 



i-l 



n 



kj +p 



\z + M -p 



1) 



i-l 



("I) 



M 

Af(M-l)/2 J~J 



Xi 



det 



"(fcj+z-1)! 
kjl 



(66) 



(67) 



In d67l) . the power of (—1) comes from bringing (— outside the determinant, while the product of (z+j — 1) J_1 
originates from the denominator in the determinant. 
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Using the same procedure as in Lemma [2 in Appendix |]J we can show that the determinant as in the right-hand 
side of (|67ll is simply equal to (-l) M ( M - 1 )/ 2 A(k). We can thus rewrite the second line in d64l as 



M 



(z+i-iy- 1 

Inserting this into d64l we see immediately that A(k) cancels leaving 



00 dx { -^(i + xy +M - 1 f(x) 
kj\ 



(68) 



(69) 



i= i {z + i _ iy -i^ A(t)A(r) 

Step 3: Resummation of series. We may now use the Cauchy-Binet theorem (see Lemma [3Jl to the above form 
of the sum over k, which gives 

/•CO 

/ dAe- r ^ A (l + Af +M -7(A) 
Jo 

TitiXn i \\z+M— 1 



det 



(70) 



det 

M 

n 



d\e 



\i + xy 



det F{r{tj] z) 



(71) 



In the second line above we discarded the function f(x), since the integral is now convergent. I.e., we now take 
the limit f(x) = lims^oe~ Sx —> 1. Thus, F(x,z) is given by [37] 



F(x, z) = x M / dXe~ xX (l + \y+ M ~ 1 = x - z e x T{z + M, x) 
Jo 



(72) 



where F(a, x) is the incomplete T function. In summary, the moment generating function of the mutual information 
g(z) can be written for n t = n r = M = N as 

g(z) = det FiUrj ; z) (73) 

where the normalization A/t,r has been cancelled with the term in front of the determinant in (TTTb . We note that 
the above expression is symmetric in T and R for M = N. 

B. g(z) for the case M > N 

Step 4: Allowing for M > N. Next, we will obtain the expression for M / N, and specifically, without loss 
of generality we will assume that m > n r , i.e. M = nt, N = n r . This can be readily obtained by first assuming 
that R is A/ -dimensional with the first N eigenvalues the ones corresponding to the correlations at the receiver 
and then taking the remaining M — N eigenvalues to 0. Thus the M / N problem is obtained by letting the 
M — N eigenvalues of one of the correlation matrices go to zero. In the particular situation, we will let rj, for 
j = N + 1 , . . . , M go to infinity. 

Even though both numerator and denominator of ( 173! are unbounded in this case, using Lemma in Appendix 
|ffl]we can see that the ratio is well-defined and finite. Specifically, we see that the function fi(xj) in the Lemma 
is F{tiTj,z). Noting that the asymptotic expansion of F(x,z) is (see [37], Equation 8.357) 



we find that when rjv+i. 



det F{tiTj; z 
A(r M ) 



F(x,z)^x M ~ l J2f 

k=0 

oo the ratio becomes 

M-N-l 



r(M + z) 

(M + z- k)' 



\l {M + z-j) 



M-N-j 



3=1 



detL 
A(r]vT 



(74) 



(75) 



where we used the notation r^r, tm to explicitly specify the length of the vectors and L is a Af -dimensional matrix 
with elements 

/ F{t irj -M + z) j<N 



L 



'j 



t 



j>N 



(76) 
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Summarizing all results above, we thus have the general form of the moment generating function g(z) of the 
mutual information I for general M, N which is 

i \ -rp I" zl~\ Il? = i N -HM + z - detL 
q(z) = Eq, e = — - 77 . . . t . , (77) 

with the elements of L given by ( l76t for n t > n r and with ti and r« interchanged for n r > n t . 

VI. Ergodic Capacity for fully correlated channels 

In this section we will derive an expression for the ergodic capacity in the case of fully correlated channels 
discussed in the previous section. Without loss of generality, we will assume that n r = N < M = nt, as is 
implicitly assumed in f77b . We apply © and differentiate g(z) in d77t with respect to z: 

9(z)= 9 (z)^Y { M + Z _ J -g7T7 + ditL^^J (78) 
We next have to set z = 0, to get £?[!], as in (JHJl, recalling that g(0) = 1, see Q. This results to 

M-AT-l • AT , ,f 

3=1 J J=l 

where the matrix Lj is the matrix L with its j-th column replaced by the column D\^j, i.e. 

Lj = [Liy, Li t 2] • • • ; Lij^i; Di^f, Aj+i; • . .] (80) 
where the element Djj is given by 

dF(tirf,M + z 




(81) 

2=0 



'Ei(-x)' 


(M-l)\ 


X 





where the superscript (M — 1) in the square brackets denotes the M — 1st derivative and Ei{x) is the exponential 
integral [37]. Note that the second sum in d79l goes up to N, since for j > N the columns in L do not depend on 
z, as seen in d771 ). Similarly one can calculate higher moments of the mutual information distribution. 

A. Example 

We will now apply the ergodic capacity equation dT^t derived above to evaluate the mutual information for the a 
simple case of MIMO correlated channels. Specifically, we will use the following simple model for the dependence 
of the correlation coefficient T a b on the antenna separation and angle spread. For concreteness, we assume that 
the antennas form a uniform linear ideal antenna array with d\ = d m i n /X the nearest neighbor antenna spacing in 
wavelengths and we assume a Gaussian power azimuth spectrum (with 2 dimensional propagation), i.e. the average 
incoming power at the antenna array is P{9) oc exp[— (6/5) 2 /2], [38], [39] where 5 is the angle spread in degrees 
measured from the vertical to the array. This results in a T matrix with elements 

T b = [ 18 ° — c 2TTi(a-b)d x sm(cl>TT/l80)~<t> 2 /(28 2 ) ^2) 

a J -iso V2ttS 2 

with a,b = 1 . . . n t being the index of transmitting antennas. Two situations will be considered: First, where the 
same angle-spread appears in both transmitter and receiver, and second where only the transmitter has the above 
correlations, with the receiver having uncorrected antennas. For simplicity, we assume that the transmitted signal 
covariance is unity, that the noise is uncorrelated and that the signal to noise ratio per transmitter antenna is unity. 
In this case, one can evaluate the eigenvalues of the correlation matrices and insert them in the appropriate capacity 
equations. Fig. ^ shows the comparison of the analytically obtained ergodic mutual information for fully correlated 
and semicorrelated channels with simulations. In Fig.|2we plot the outage mutual information obtained for a number 
of representative cases as calculated numerically using (TTOb and compare them with Monte Carlo simulations. It 
should be stressed however, that results obtained analytically are exact, and we only include Monte-Carlo simulations 
as a cross-check. 
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Fig. 1. Plot of mutual information for a n t — 4-transmitting-antenna and a n r -receiving-antenna array pair for n r = 3,4. Both transmitter 
and receiver arrays are uniformly spaced with dmin spacing between neighboring antennas. The solid and dashed curves have correlations 
between elements on both antenna arrays given by 1821 with angle spread 5 = 10°. Once the eigenvalues of the correlation matrix are 
evaluated, they are inserted in 1791 . which gives the ergodic mutual information. The receiving array of the other two curves (dotted and 
dot-dashed) has no antenna correlations. In this case i 1081 in Appendix IIV- Al is used to calculate the mutual information. We see that at 
large antenna separation the antenna correlations become negligible and the correlated cases approach the semicorrelated ones. The asterisks 
represent the values of the simulated average capacities and have been added as a cross-check. 



VII. Conclusion 

In conclusion, we have demonstrated a promising method recently derived in the field of representations of Lie 
groups, which can be used to calculate complex integrals over unitary groups. We applied this technique to recover 
in a simple way the joint probability distribution of eigenvalues of the matrix GG^, with G being semicorrelated 
or having a non-zero mean, which can then be used to calculate the moment generating function of the mutual 
information for such Gaussian MIMO channels. In addition, we used this method to calculate the moment generating 
function of the mutual information for fully correlated channels. From this moment generating function we obtained 
the ergodic average of the mutual information and evaluated the outage probability. We believe these methods are 
general and applicable to a wide range of multi-antenna communications problems. As an example, in Appendix 
13 we also analyzed their application to unitary encoded space-time transmission of MIMO systems, calculating, as 
a particular example, the received signal distribution when the channel matrix is correlated at the transmitter end. 
We expect there will be further applications of these techniques in the future. 
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Fig. 2. Plot of the outage mutual information for a nt = 4-transmitting-antenna and a n r -receiving-antenna array pair for n r = 3, 4 and 
two different array spacings d m i„. The other parameters used are the same as in Fig.Q The curves correspond to the values of probability 
versus outage capacity, which were obtained by numerically integrating 1 101 . The asterisks result from Monte-Carlo simulations and have 
been added as verification of the numerical integral. 



Appendix I 
Proof of Lemma [T] 



Proof: We start by writing d m as 



M 



det 



(h-M + j)\ 



using dT4b . The i,j -th element of the matrix in the determinant can be expanded to 

ki(ki-l)...(ki-M + j + l) 



(83) 



(84) 



For example, the Mth column has elements equal to unity, the (M — l)th column has elements equal to ki, etc. 
We note that the product in (l%4l in fact vanishes when ki < M — j, as noted after (fL^t . As a result, the above 
expression is valid for every i,j and ki. If we expand the above expression in powers of ki we get 



M-j 

E 

P =i 



M—j-p+l 



(85) 



with the coefficient a\j = 1 for all j = 1, . . . ,M and all other coefficients independent of i. We now subtract 
aM-j,jki, i.e. a multiple of column M — 1 from all columns j = 1, . . . , (M — 2). This operation leaves the value 
of the determinant invariant, but reduces the upper limit in the summation in d85l to M — j — 1. We may now 
continue this process of subtracting off a p jk i ~ 3 ~ p+ for p = (M — j — 1), (M — j — 2), ... ,2 until the only 



17 



term appearing in the ijth term is kf' 1 J . This is exactly the form of the Vandermonde matrix, with columns j and 
M — j + 1 interchanged. Thus by reshuffling the columns to get the usual ordering, and thereby picking up a term 

(_l)M(M-l)/2 ; W£ ^jyg tQ thfi form of ,TJJ B 

Appendix II 

In this appendix we will show that the quantity X(T, R, A) defined in d52l as 

T = C M A(A) J DU J DVexp [-Tr |T" 1 U(im 5 (/x)V t R" 1 V^a 5 (/x) t U t }] (86) 

is exponentially small when any of the A's becomes arbitrarily large. To prove this we start by showing that 
Lemma 4: For any two M-dimensional positive definite matrices A, B 

Tr{AB} > a m ,; n Tr{B} (87) 

where a m i n is the minimum eigenvalue of A. Equality holds when A is proportional to the unit matrix. 

Proof: Let aj., . . . , clm be the eigenvalues of A and Ua be the unitary matrix diagonalizing A, and b±, . . . , bu 
and Ub the corresponding eigenvalues and unitary matrix for B, i.e., A = UA^ag(a)U A and B = UBcfaa<?(b)U B . 
Then 



Tr{AB} 



(88) 



> 



Tr jdm 5 (a)U A U B (imc/(b 1/2 )) (u A U B dmc/(b 1/2 ))) * 

M M 

E°*E \u\Vvdiag(h 1 ' 2 

i=i j=i 

M M 

0"min 

i=l j=l 

= a min Tr|u A BU A } 

= «minTr{B} 

where diagih 1 / 2 ) is a diagonal matrix with diagonal elements b l J 2 , for i = 1, . . . , M. The inequality holds since 
the term multiplying each a« is non-negative. One gets an equality only if A is proportional to the unit matrix. ■ 
We now apply this result to the trace in the exponent of d86b 

Tr{T- 1 Udm5(/^)V t R- 1 V(im 5 (/i) t U t } > t min Tr {U(im 5 (/i)V t R- 1 Vdfac/(/x) t U t J (89) 

= t min Tr{R" 1 V(iia 9 (A)V t } 
> t min r min Tr {diag(X)} 

where t m i n and r m j„ are the minimum eigenvalues of T" 1 , R^ 1 respectively (or the inverses of the maximum 
eigenvalues of T, R, which are assumed non-zero). As a result, 

T t,r,A ^ C M A(A) j DU J DVexp[-t r , 

M 



'"mill' ruin 



Tr {diag(X)}] 



(90) 



C M A(A) exp 



-t 



mm' mm 



E A * 



i=l 



Since t m i n r m i n > we finally have 

Lemma 5: The quantity X(T,R, A) in d86l) is bounded by an exponential function of any \ when for any 
i = 1, . . . , M, X{ becomes arbitrarily large. 
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Appendix III 
Limit of Ratios of Functions 

This appendix concerns limits of ratios of the form 

det[fj(xj)] 



R([xi, . . . x M \) 



A([xi, . . . x M ]) 



(91) 



when several xfs become equal. In the above equation, i,j = 1, . . . , M, A(x) is the Vandermonde determinant 
and det[fi(xj)] is the determinant of an M x M matrix whose i,j'-th element is fi(xj). 



Lemma 6: When two or more of the Xj's in d9Tl ) become equal, then both numerator and denominator are zero. 
However, the ratio is generally well defined. In particular, 



lim R([xi,. . . x M \) 

Xi,X2,—,X p —*X 



(92) 



fi(xo); f; '(xq); ...;/> ; (x ); fi(x p+ i); fi(x p+2 ); ■■■] fi(x M ) 



TP" 1 i\ 



A(x p+1 . . . x M ) Uii P+1 (xi - x )p Ylj=i 3 ! 

where represents the k th derivative of the function /. (For clarity, we separate the columns by semicolons, 
and note that the different rows of the determinant correspond to different values of i = 1, . . . , M). 

Proof: Proof will be by induction on p. The theorem is trivial for p = 1. Suppose the theorem holds up to 
some given p. Then we would like to take the limit as x p+ \ — ► xq. Examining the matrix of the numerator, we 
write the elements of the p + l'st column in a Taylor expansion 



fi(x 



/<+0 = H 

3=0 



#(*o) 



(x p+ i - x y 



(93) 



We will now use the fact that you can subtract any multiple of one column from another column without changing 
the value of the determinant. Since the j th column is ff\xo) we can thus subtract off the first p terms of this sum 
(j = 0. . .p— 1) without changing the determinant, leaving us with the p + l'st column given by 



/iCsp+iW^Czo) 



(xp+i - x ) p 



pi 



+ higher powers of (x p+ i — xq) 



(94) 



We now examine the denominator in i92t . Again, we assumed that the theorem holds for some given p. Thus, as 
we take x p+ \ — ► xq the denominator becomes 



M 



lim A(x p+ i . . . x M ) I| ( x * ~ x o) 

l=p+l 



p-l 

n 



(95) 



(Xp+l - So) 



A(x p+2 . . . x M ) n ~ x ° 

i=p+2 



p-l 
3=1 



Y[ (Xi-Xp+i) 
i=p+2 

Plugging in the results of d95l > and ( 194b into \92\ proves the lemma for p + 1. 

A related result can be shown for the case when some elements of the vector x go to infinity: 
Lemma 7: If the asymptotic behavior of fi{xj), when Xj —> oo is of the form 



fi{ x j 



fc=0 



(*)--* 



then we have the following limit 



lim i?(si, . . . xm) 

Xi,X 2 ,..., x p — +oo 



det 



/i (0) ; • • • ; f\ K> ; fi(x P +i); /i(s p+2 ); • • • ; /iM 



?(*). 



A(x p+1 



■ xm 



(96) 



(97) 



19 



Appendix IV 

Moment generating function for semicorrelated and non-zero mean channels 

In this appendix we use the joint probability distribution P({Aj}) derived in Sections ll V-B I and II V-C I to calculate 
the corresponding moment generating function g(z), ill lb . We start with the following Lemma. 
Lemma 8: Let F({Aj}) be a function of Aj, for i = 1, . . . , N, defined as 

N 

F({A i })=/Cn/(Ai)A(A)detP(A) (98) 

i=i 

where /C is a constant independent of Aj and the M x M matrix P, with M > N, has elements 

P .._/MA;) 3<N 

for some function pi(x). Then integrating F over A we have 

Yl d\jF({\i}) = N\ det P (100) 

3=1 ° 

where the M x M matrix P has elements 

Proof: We expand the two determinants as in (|2li resulting in 

I]/ ^({A,}) = ^^sgn(a)sgn(b)n / dXJi^K^PbXK) \{ q ibi (102) 

j=l a b i=l J ° i=N+l 

N M 

= ee^^ii^^ n 9ib, 

a b i=l i=7V+l 

M 

= AT!^sgn(b)n^ 

b i=l 

= AHdetP 

■ 

We now apply this lemma to the two cases of semicorrelated and non-zero mean channels analyzed in Sections 
ITV^ and lr\7-Cl 

A. Moment generating function for semicorrelated channels 

We can immediately apply the above lemma to calculate the moment generating function. We distinguish two 

cases: 

1) N = nt < n r = M: In this case, the form of the matrix P simplifies, since there are no qij terms (see i38l ) 
and we thus have /(A) = (1 + A) z A n,_nt and Pj(X) = e~ tjX , resulting in 

j( Z ) = (-l)-M)/>^^ (103) 

with the n t x nt matrix L has elements 

Lij = ^(n r - n t + j,n r - n t + j + z + 1, U) (104) 

where ty(a,b,z) is the confluent hypergeometric function [37], given by 

1 r°° 

*(o, b, z) = —— / dte' tz t a - 1 (l + t) b - a - 1 (105) 

r(o) Jo 

Note we absorbed the product of factorials in d38l into the ^ function. 



20 



2) M = n t > n r = N: In this case we use (B9l and set /(A) = (1 + X) z , qij = t\ TLr 1 and Pi(X) = e 



resulting in the following expression for g(z) 



j(z) = (-irM)/»g^^ 

i=i ' 



with the n t x rit matrix L has elements 
L 



*(rt r - n t + j, re r - n t + j + z + 1, ij) j >n t - n r 



(106) 



(107) 



Ergodic Capacity: By employing methods similar to Section [VI] the expression for the ergodic mutual 
information of semicorrelated channels, which is analogous to d79l and in agreement with [17], [18] 



N 



i=i 



detL 



(108) 



where L is given by (1103b or (I106t evaluated at z = and Lj is given by the matrix L, with the elements of the 
column j + max(0, nt — n r ) substituted by 



Di 



f—UM-rtt+j 

(M-nt+j -1)1 



e u Ei(-ti 



(M-TH+j-l) 



(109) 



with the superscript in the last term indicating the number of derivatives applied. 



B. Moment generating function for channels with non-zero mean 

Applying Lemma [S] to ( PUfl ) we obtain the following expression for g(z) 



yiM—N 

5(z) = ( _ 1) (^o + iV)(M-l)iiri_ 



c*— i)i 



i)!] 2 5 1 



detL 
AW 



L is a M x M matrix with elements 



/ °° dXXi-^l + A) 2 e- A / (2^A) i<N ,j<N 
{i + j-N - + j -N -l,i + j-N + z,l) i> N ,j < N 

lt N ^ i<N ,j>N 



[(i-N -l)\Y6i- No)j - N 



i > N ,j > N 



(110) 



(111) 



Appendix V 

Other directions: Application of Unitary Integrals to Unitary Space-Time Encoding 

In this section we will briefly show another application of integration over unitary groups, where the character 
expansion and other results analyzed above may be applied, namely in the field of unitary space-time encoding. 
This is a scheme suggested recently by [24]-[27], in which the signal is encoded using unitary matrices across 
multiple transmit antennas and time-slots. The interest in this encoding method lies in its very good performance 
for large signal to noise ratios in the absence of any channel information at the receiver. [27] 

In particular, we assume that the channel matrix G is constant over T time-intervals and then changes completely. 
In this case the channel equation © needs to be modified to [24] 



Y = GX + Z 



(112) 



where G is a n t x n r matrix with the channel coefficients from the transmitting to the receiving arrays, Z is 
the n r x T additive noise matrix, while Y and X are n r x T and n t x T dimensional matrices representing the 
output and input signals. The noise elements Z^j are assumed to be Gaussian i.i.d. with unit variance, while G 
is semicorrelated at the transmitter end, i.e. E[Gi a Gjb] = &ifE a b- Their instantaneous values are assumed to be 
unknown to both the transmitter and the receiver. 
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Recently [27] analyzed the mutual information of this problem, when the input distribution X is unitary, i.e. 
when 



xxt 



(but X^X / It)- One can thus express X as 



X 



Jn t U 



(113) 



(114) 



where U is an T x T unitary matrix and J nt is a nt X T matrix with J a b = S a b- (II 14t includes only n t out of T 
orthogonal T-dimensional vectors. To obtain the capacity of such encoding structures, it is essential to first calculate 
the received signal probability distribution, which involves an integration over all transmitted signals, which in this 
case are unitary matrices. 

Since both G and Z are Gaussian matrices, the conditional distribution of Y on X, p(Y|X) will also be Gaussian, 
with density 



exp 



p(Y|X) 



-Tr < Y 



I T + XtTX Yt 



n Tn r det(I T + XtTX)^ 
exp (-Tr{Y [l T - Xtj-^x] Yt}) 



■K Tn - det(I„ t + T)^ 

where the second equation results from X being unitary, (^OJ. The received signal distribution is 



(115) 



p(Y) 



[27] obtained a closed form expression for the case T a b 
closed form expression for a general positive definite T. 
We can rewrite the above integral over X using (II 141 as 



dXp(Y|X) (116) 
S a b using different methods. Here we will obtain a 



p(Y) 



DU 



exp ^— Tr {Y [l T -Utfu] Yt}) 
7r Tn - det(I nt + T) n - 



where 



(117) 



(118) 



We immediately see that the integral in ( II lit is of the form of (I29t . which allows us to immediately obtain the 
result. We are interested in the case T > n t . [27] Defining the T-dimensional vector t so that U = t j = Tj/(1 + Tj) 
for i = 1, ... , nt, where Tj are the eigenvalues of T, and using d37t we have 



p(Y) 



n?=ip! 



det (e^A 



n"ii(l + ^-) A(y)A(t) 



rrexp(-^yi 



(119) 



where y is the T-dimensional vector of eigenvalues of YtY. We now need to take t{ — > for i = n t + 1, . . . , T. 
Also, if T > n T we need to set yi = for i = n r + 1, . . . , T. These limits can be taken by using Lemma |6] The 
final result is 



p(Y) 



detL 



n^i(i + rj) nr^" 1 ii A( y ) n£i yr Q A(t) n -4i ^- nt 

where Q = min(T, n r ) and the T-dimensional matrix L has elements 

e v ' tj i < Q,j < n t 

vl nt ' X i<Q,j>n t 
i> Q,j <n t 



ex p(- H 



(120) 



i=l 



Li 



(121) 



!)!&. 



i-n r ,]-n t 



i> Q,j > n t 
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